NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

STZ: A High Quality and High Speed Streaming Lossy Compression Framework for Scientific Data

https://doi.org/10.1145/3712285.3759795

Wang, Daoce; Grosset, Pascal; Pulido, Jesus; Tian, Jiannan; Athawale, Tushar; Jia, Jinda; Sun, Baixi; Zhang, Boyuan; Jin, Sian; Zhao, Kai; et al (November 2025, ACM/IEEE)

Free, publicly-accessible full text available November 15, 2026
COMPSO: Optimizing Gradient Compression for Distributed Training with Second-Order Optimizers

https://doi.org/10.1145/3710848.3710852

Sun, Baixi; Liu, Weijin; Pauloski, J Gregory; Tian, Jiannan; Jia, Jinda; Wang, Daoce; Zhang, Boyuan; Zheng, Mingkai; Di, Sheng; Jin, Sian; et al (February 2025, ACM)

Free, publicly-accessible full text available February 28, 2026
A High-Quality Workflow for Multi-Resolution Scientific Data Reduction and Visualization

https://doi.org/10.1109/SC41406.2024.00091

Wang, Daoce; Grosset, Pascal; Pulido, Jesus; Athawale, Tushar M; Tian, Jiannan; Zhao, Kai; Lukić, Zarija; Huebl, Axel; Wang, Zhe; Ahrens, James; et al (November 2024, IEEE)

Full Text Available
Accelerating Communication in Deep Learning Recommendation Model Training with Dual-Level Adaptive Lossy Compression

https://doi.org/10.1109/SC41406.2024.00095

Feng, Hao; Zhang, Boyuan; Ye, Fanjiang; Si, Min; Chu, Ching-Hsiang; Tian, Jiannan; Yin, Chunxing; Deng, Summer; Hao, Yuchen; Balaji, Pavan; et al (November 2024, IEEE)

Full Text Available
CUSZ-i: High-Ratio Scientific Lossy Compression on GPUs with Optimized Multi-Level Interpolation

https://doi.org/10.1109/SC41406.2024.00019

Liu, Jinyang; Tian, Jiannan; Wu, Shixun; Di, Sheng; Zhang, Boyuan; Underwood, Robert; Huang, Yafan; Huang, Jiajun; Zhao, Kai; Li, Guanpeng; et al (November 2024, IEEE)

Full Text Available
A Survey on Error-Bounded Lossy Compression for Scientific Datasets

https://doi.org/10.1145/3733104

Di, Sheng; Liu, Jinyang; Zhao, Kai; Liang, Xin; Underwood, Robert; Zhang, Zhaorui; Shah, Milan; Huang, Yafan; Huang, Jiajun; Yu, Xiaodong; et al (May 2025, ACM Computing Surveys)

Error-bounded lossy compression has been effective in significantly reducing the data storage/transfer burden while preserving the reconstructed data fidelity very well. Many error-bounded lossy compressors have been developed for a wide range of parallel and distributed use cases for years. They are designed with distinct compression models and principles, such that each of them features particular pros and cons. In this paper we provide a comprehensive survey of emerging error-bounded lossy compression techniques. The key contribution is fourfold. (1) We summarize a novel taxonomy of lossy compression into 6 classic models. (2) We provide a comprehensive survey of 10 commonly used compression components/modules. (3) We summarized pros and cons of 47 state-of-the-art lossy compressors and present how state-of-the-art compressors are designed based on different compression techniques. (4) We discuss how customized compressors are designed for specific scientific applications and use-cases. We believe this survey is useful to multiple communities including scientific applications, high-performance computing, lossy compression, and big data.
more » « less
Free, publicly-accessible full text available May 2, 2026
Multifacets of lossy compression for scientific data in the Joint-Laboratory of Extreme Scale Computing

https://doi.org/10.1016/j.future.2024.05.022

Cappello, Franck; Acosta, Mario; Agullo, Emmanuel; Anzt, Hartwig; Calhoun, Jon; Di, Sheng; Giraud, Luc; Grützmacher, Thomas; Jin, Sian; Sano, Kentaro; et al (February 2025, Future Generation Computer Systems)

Free, publicly-accessible full text available February 1, 2026
FCBench: Cross-Domain Benchmarking of Lossless Compression for Floating-Point Data

https://doi.org/10.14778/3648160.3648180

Chen, Xinyu; Tian, Jiannan; Beaver, Ian; Freeman, Cynthia; Yan, Yan; Wang, Jianguo; Tao, Dingwen (February 2024, Proceedings of the VLDB Endowment)

While both the database and high-performance computing (HPC) communities utilize lossless compression methods to minimize floating-point data size, a disconnect persists between them. Each community designs and assesses methods in a domain-specific manner, making it unclear if HPC compression techniques can benefit database applications or vice versa. With the HPC community increasingly leaning towards in-situ analysis and visualization, more floating-point data from scientific simulations are being stored in databases like Key-Value Stores and queried using in-memory retrieval paradigms. This trend underscores the urgent need for a collective study of these compression methods' strengths and limitations, not only based on their performance in compressing data from various domains but also on their runtime characteristics. Our study extensively evaluates the performance of eight CPU-based and five GPU-based compression methods developed by both communities, using 33 real-world datasets assembled in the Floating-point Compressor Benchmark (FCBench). Additionally, we utilize the roofline model to profile their runtime bottlenecks. Our goal is to offer insights into these compression methods that could assist researchers in selecting existing methods or developing new ones for integrated database and HPC applications.
more » « less
Full Text Available
GPULZ: Optimizing LZSS Lossless Compression for Multi-byte Data on Modern GPUs

https://doi.org/10.1145/3577193.3593706

Zhang, Boyuan; Tian, Jiannan; Di, Sheng; Yu, Xiaodong; Swany, Martin; Tao, Dingwen; Cappello, Franck (June 2023, ICS '23: Proceedings of the 37th International Conference on Supercomputing)

Full Text Available
GPULZ: Optimizing LZSS Lossless Compression for Multi-byte Data on Modern GPUs

Zhang, Boyuan; Tian, Jiannan; Di, Sheng; Yu, Xiaodong; Swany, Martin; Tao, Dingwen; Cappello, Franck (June 2023, The 37th ACM International Conference on Supercomputing (ICS 2023))

Full Text Available

« Prev Next »

Search for: All records